System and method for speech-enabled call routing

ABSTRACT

In a particular embodiment, a method includes receiving a communication from a source. The communication is associated with an account of the source. The method further includes determining, at a processor, whether a time period satisfies a threshold time period. The time period is associated with a voice message box having a particular status. The method further includes routing the communication to a first destination based on a determination that the time period satisfies the threshold time period.

CLAIM OF PRIORITY

This application is a continuation of, and claims priority from U.S. patent application Ser. No. 12/787,468, filed May 26, 2010, and entitled “SYSTEM AND METHOD FOR SPEECH-ENABLED CALL ROUTING,” which is a continuation of U.S. patent application Ser. No. 11/032,495, filed Jan. 10, 2005, now issued as U.S. Pat. No. 7,751,551, and entitled “SYSTEM AND METHOD FOR SPEECH-ENABLED CALL ROUTING,” each of which is hereby incorporated by reference in its entirety.

FIELD OF THE DISCLOSURE

The present disclosure relates generally to speech recognition systems and more particularly to speech recognition-enabled automatic call routing systems and methods.

BACKGROUND

Speech recognition systems are specialized computers that are configured to process and recognize human speech and may also take action or carry out further processes. Developments in speech recognition technologies support “natural language” type interactions between automated systems and users. A natural language interaction allows a person to speak naturally. Speech recognition systems can react responsively to a spoken request. An application of natural language processing is speech recognition with automatic call routing (ACR). A goal of an ACR application is to determine why a person is calling a service center and to route the caller to an appropriate agent or destination for servicing a request. Speech recognition technology generally allows an ACR application to recognize natural language statements so that the caller does not have to rely on a menu system. Natural language systems allow the caller to state the purpose of their call “in their own words.”

Natural language ACR systems attempt to interpret the intent of the caller based on the spoken language and then selects a routing destination. When a speech recognition system misinterprets the caller's intent, significant problems can result. A caller who is misrouted is generally unhappy. Misrouted callers realize that there has been a mistake with the automated phone service and often terminate the call by hanging up. If a caller does not hang up they will typically be routed to an operator who will attempt to manually route the call. Routing a caller to an undesired location and to human operators leads to considerable inefficiencies for a business. Most call routing systems handle a large volume of calls and, even if a small percentage of calls are abandoned, the costs associated with abandoned calls are significant. Accordingly, there is a need for an improved automated method and system of routing calls.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a simplified configuration of a telecommunication system;

FIG. 2 is a general diagram that illustrates a method of routing calls;

FIG. 3 is a flow diagram that illustrates a method of processing and routing calls;

FIG. 4 is a table that depicts speech input and mapped synonym terms; and

FIG. 5 is a table illustrating action-object pairs and call destinations relating to the action-object pairs.

DETAILED DESCRIPTION OF THE DRAWINGS

In a particular embodiment, a method includes receiving speech of a call from a caller at a processor of a call routing system. The method includes using the processor to determine a first call destination for the call based on the speech. The method includes using the processor to determine whether the caller is in compliance with at least one business rule related to an account of the caller. The method includes routing the call to the first call destination with the processor when the caller is in compliance with the at least one business rule. The method also includes routing the call to a second call destination with the processor when the caller is not in compliance with the at least one business rule.

In a particular embodiment, a method includes transforming speech of a call received from a caller into text using a processor. The method includes converting the text into an object and an action with the processor. The method includes determining, with the processor, a first call destination based on the object and the action. The method includes determining, with the processor, whether the caller is in compliance with a first business rule related to an account of the caller. The method also includes routing the call to a second call destination with the processor when the caller is not in compliance with the first business rule.

In a particular embodiment, a system including a processor and a network interface to receive speech input of a call from a caller. The system includes an action-object routing module executable by the processor to determine a first call destination based on the speech input. The system also includes a business rule module executable by the processor to determine whether the caller is in compliance with a business rule related to payment of an account of the caller. The system routes the call to a second call destination when the caller is not in compliance with the business rule and the system routes the call to the first call destination when the caller is in compliance with the business rule.

A method of processing a call is disclosed. The method can transform speech input from a caller of a call into text and convert the text into an object and an action. The method determines a call destination based on the object and the action. The method can route the call to a destination when a caller is not in compliance with at least one business rule. The method can further route the call to the call destination when the caller is in compliance.

In a particular embodiment, the system includes a call routing system that has a processor configured to convert speech input from a caller into an object and an action. The system has an action-object routing module configured to determine a first destination for the call based on the object and the action. The call routing system can route the call to a second destination when the caller is not in compliance with a business rule.

In another embodiment the communication system includes an acoustic model configured to accept speech input and to produce text and a semantic model coupled to the acoustic model for producing an action and an object responsive to the text. The system further includes a routing module table responsive to the semantic model to provide a routing destination based on the action and the object.

Referring now to FIG. 1, an illustrated communications system 100 that includes a call routing support system is shown. The communications system 100 includes a speech-enabled call routing system (SECRS) 118, such as an interactive voice response system having a speech recognition module. The communications system 100 includes a plurality of potential call destinations. Illustrative call destinations shown include service departments, such as billing department 120, balance information 122, technical support 124, employee directory 126, and new customer service departments 128. The system 100 includes a communication network 116 that receives calls from a variety of callers, such as the illustrated callers 110, 112, and 114. In a particular embodiment, the communication network 116 may be a public telephone network or may be provided by a voice over Internet protocol (VoIP) type network. The SECRS 118 may include components, such as a processor 142, a synonym table 144, and an action-object routing module 140. The SECRS 118 is coupled to and may route calls to any of the destinations, as shown. In addition, the SECRS 118 may route calls to an agent, such as the illustrated live operator 130. An illustrative embodiment of the SECRS 118 may be a call center having a plurality of agent terminals attached. Thus, while only a single operator 130 is shown, it should be understood that a plurality of different agent terminals or types of terminals may be coupled to the SECRS 118, such that a variety of agents may service incoming calls. In addition, the SECRS 118 may be an automated call routing system. In a particular embodiment, the action-object routing module 140 includes an action-object lookup table for matching action-object pairs to desired call routing destinations.

Referring to FIG. 2, an illustrative block diagram of the SECRS 118 is depicted. In this particular embodiment, the processor 142 in the SECR 118 includes an acoustic model 210, a semantic model 220, and an action-object routing table 230. The acoustic model 210 receives speech input 202 and provides text as its output 204, and the semantic model 220 receives text 204 from the acoustic model 210 and produces an action-object pair. The actions and objects in the action-object table are ordered or ranked according to a confidence level. A confidence level indicates how likely a particular action or object reflects a correct and useable caller instruction. A call can be routed based on the overall highest confidence level action and object in an action-object list in the routing table 230. An exemplary routing table is depicted in FIG. 5. Objects are listed on the left side of the chart and actions are listed across the top of the chart. If the action of “acquire” has the highest confidence level for an action and “DSL” has the highest confidence level for an object, then the call would be routed to extension “001” according to the intersection in the table of the action-object pair “acquire-DSL.” As a further example, when the action-object pair of “cancel-DSL” is selected, then the call would be routed to extension “002” according to the routing table. In the event that a confidence level for an action-object pair is below a predetermined level, the call may be routed to a human operator or agent terminal.

The routing table 230 can receive a selected action-object pair 206 from the semantic model 220 and produces a desired call routing destination 208. Based on the call routing destination 208, a call received at the SECRS 118 may be routed to a destination, such as billing department 120 or technical support service destination 124, as shown in FIG. 1. In a particular embodiment, the action-object routing table 230 may be a look-up table or a spreadsheet, such as Microsoft Excel™.

Referring to FIG. 3, an illustrative embodiment of a method of processing a call using an automated call routing system, such as the system of FIG. 1, is illustrated. The method starts at 300 and proceeds to step 302 where a speech input signal, referred to as an utterance, is received or detected. Information about the caller is also received, such as information that can identify the caller or the caller's account number. This information can be automatically provided or it can be requested using a voice prompt and received as part of the utterance or as touchtone input. Using phonemes, the received speech input is converted into a plurality of word strings or text in accordance with an acoustic model, as shown at steps 304 and 306. In a particular embodiment, confidence levels are assigned to word strings based on established rules and the content and coherency of the word string. Next, at step 308, the word strings are parsed into actions and objects. Objects generally represent nouns and adjective-noun combinations while actions generally represent verbs and adverb-verb combinations. The actions and objects are assigned confidence values based on how likely they are to reflect the intent of the caller.

Many possible actions and objects may be detected or created from the word strings. To aid in the selection of a particular action and object, a table such as a synonym table can be utilized, as shown in FIG. 4. The synonym table can store natural language phrases and map the phrases to a predetermined list of actions and objects. Textual conversion of the natural language spoken by the caller in the acoustic model can be compared to natural language phrases in the table based on the phonetic-to-text conversion. Using the synonym table, the system and method maps portions of the natural phrases to at least one action or at least one object.

Additionally, the synonym table can provide rules for distinguishing recognized actions and recognized objects from specific actions and specific objects. Generally, there may be numerous actions that map to the same destination and numerous objects that map to the same destination. For example, in FIG. 4, the recognized object “Named Services” can include specific objects, such as “Caller ID,” “Call Notes,” “Call Forward” and other features or services that a single call service department may process. Thus, when a call request creates a recognized action or recognized object from a class of possible actions or objects, the call can be routed based on the recognized and selected action or the recognized and selected object. The specific action or specific object can be stored and transmitted with the call such that a destination system can utilize the action or object to assist the caller and meet the caller's objectives.

Recognized actions and objects can be assigned a confidence level based on textual similarities between predetermined or known actions and objects and results of the phonetic conversion. Additionally, rules that are based on consistency between a recognized action or an object and known phonemes can also be utilized to determine the confidence level. The method selects a most probable action and object from a list of recognized objects and actions utilizing the confidence levels, as is illustrated in steps 310 and 312. The selected action and selected object are paired and a current routing destination is determined at step 314. The method can track the treatment of the call utilizing three variables known as prior routing destination, current routing destination, and future routing destination. The prior routing destination variable refers to a previous location having processed the call, the current routing destination refers to a present location that is processing the call, and the future routing destination variable refers to where the call may be routed for processing. The variables can be set and reset throughout the process, as the call is routed, possibly among call centers. Thus, after an initial action object selection, the variables may be as follows: future routing destination equals null; prior routing destination equals null; current routing destination equals action-object table selection results.

Using the caller information received, the method determines if the caller is violating or has violated any business rules at decision step 316. For example, if a caller is late on their payment, or has a voice message box that has been full for over two months, the method may determine that the caller has violated a business rule. When a business rule has been violated, the call can be routed to a destination or a call service department determined by the business rules, as shown in step 320. When the call is transferred to a call service department that can overcome the rule violation, the variable can be reset as follows: (future routing destination equals table results; current routing destination equals business rules table; prior routing destination equals null). The call is routed to the current routing destination at step 322 and the caller is encouraged to take action to comply with the business rule that has been violated. After an attempt has been made to comply with the violation, the variables are reset (prior routing destination equals current routing destination; current routing destination equals future routing destination; future routing destination equals null), at step 324. The method proceeds back to decision step 316 to determine if the caller remains in violation of at least one business rule. If the caller is not in violation of a business rule, then the call is routed to the caller's desired destination, at step 318, the call routing destination variables are reset, and the method ends at step 326.

The above-disclosed subject matter is to be considered illustrative, and not restrictive, and the appended claims are intended to cover all such modifications, enhancements, and other embodiments that fall within the scope of the present disclosure. Thus, to the maximum extent allowed by law, the scope of the present disclosure is to be determined by the broadest permissible interpretation of the following claims and their equivalents, and shall not be restricted or limited by the foregoing detailed description. 

What is claimed is:
 1. A method comprising; receiving a communication from a source, wherein the communication is associated with an account, and wherein the account is associated with the source; determining, at a processor, whether a time period satisfies a threshold time period, wherein the time period is associated with a voice message box, wherein the voice message box is associated with the account, wherein the voice message box is associated with a particular status, and wherein the particular status is associated with an available capacity of the voice message box; and routing the communication to a first destination based on a determination that the time period satisfies the threshold time period.
 2. The method of claim 1, wherein the particular status indicates that the voice message box is full.
 3. The method of claim 1, further comprising identifying the first destination based on an audio input signal received during the communication.
 4. The method of claim 1, wherein the communication is a voice over internet protocol call or a public telephone network call.
 5. The method of claim 1, further comprising routing the communication to a second destination based on a determination that the time period does not satisfy the threshold time period.
 6. The method of claim 5, further comprising, after routing the communication to the second destination: setting a first value of a first variable, wherein the first value is based on the second destination; and setting a second value of a second variable, wherein the second value is based on the first destination.
 7. The method of claim 1, further comprising, after determining that the time period satisfies the threshold time period, determining a payment status associated with the account.
 8. The method of claim 7, further comprising routing the communication based on the payment status associated with the account.
 9. The method of claim 7, wherein the communication is routed to a second destination when the payment status is an unpaid status.
 10. The method of claim 9, further comprising, in response to a change in the payment status, routing the communication from the second destination to the first destination.
 11. The method of claim 9, wherein the second destination is a billing department destination.
 12. A method comprising: determining, at a processor, a first destination based on a communication from a source, wherein the communication is associated with an account, and wherein the account is associated with the source; determining a time period that a voice message box is associated with a particular status, wherein the voice message box is associated with the account, and wherein the particular status is associated with an available capacity of the voice message box; and routing the communication based on a comparison of the time period to a threshold time period.
 13. The method of claim 12, further comprising routing the communication to the first destination when the time period satisfies the threshold time period.
 14. The method of claim 13, wherein the time period satisfies the threshold time period when the time period is greater than the threshold time period.
 15. The method of claim 12, further comprising: determining text based on the communication, wherein the text is associated with the communication; converting the text into an object and an action; and determining the first destination based on the object and the action.
 16. The method of claim 12, further comprising transforming speech communicated during the communication to generate text.
 17. A system comprising: a network interface configured to receive a communication from a source, wherein the communication is associated with an account, and wherein the account is associated with the source; a processor; and a memory that includes instructions that are executable by the processor to perform operations including: determining a time period that a voice message box is associated with a particular status, wherein the voice message box is associated with the account, and wherein the particular status is associated with an available capacity of the voice message box; and routing the communication based on a comparison of the time period to a threshold time period.
 18. The system of claim 17, wherein the memory includes an action-object routing module executable by the processor to determine a routing destination based on speech received during the communication.
 19. The system of claim 18, wherein a synonym table and a routing table are accessible to the action-object routing module. 